Picture for Yixiao Huang

Yixiao Huang

Transformers Provably Learn to Internalize Chain-of-Thought

Add code
May 27, 2026
Viaarxiv icon

Multi-Objective Learning for Diffusion Models: A Statistical Theory under Semi-Supervised Learning

Add code
May 24, 2026
Viaarxiv icon

AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery

Add code
May 22, 2026
Viaarxiv icon

Breaking the Reversal Curse in Autoregressive Language Models via Identity Bridge

Add code
Feb 02, 2026
Viaarxiv icon

Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers

Add code
Jun 12, 2025
Viaarxiv icon

OVERT: A Benchmark for Over-Refusal Evaluation on Text-to-Image Models

Add code
May 28, 2025
Viaarxiv icon

Fast Adversarial Training against Sparse Attacks Requires Loss Smoothing

Add code
Feb 28, 2025
Figure 1 for Fast Adversarial Training against Sparse Attacks Requires Loss Smoothing
Figure 2 for Fast Adversarial Training against Sparse Attacks Requires Loss Smoothing
Figure 3 for Fast Adversarial Training against Sparse Attacks Requires Loss Smoothing
Figure 4 for Fast Adversarial Training against Sparse Attacks Requires Loss Smoothing
Viaarxiv icon

On the Power of Convolution Augmented Transformer

Add code
Jul 08, 2024
Viaarxiv icon

Towards Efficient Training and Evaluation of Robust Models against $l_0$ Bounded Adversarial Perturbations

Add code
May 08, 2024
Viaarxiv icon

Mechanics of Next Token Prediction with Self-Attention

Add code
Mar 12, 2024
Figure 1 for Mechanics of Next Token Prediction with Self-Attention
Figure 2 for Mechanics of Next Token Prediction with Self-Attention
Figure 3 for Mechanics of Next Token Prediction with Self-Attention
Figure 4 for Mechanics of Next Token Prediction with Self-Attention
Viaarxiv icon